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Abstract 

This paper investigates control laws allowing mobile, autonomous agents to optimally position 
themselves on the line for distributed sensing in a nonuniform field. We show that a simple static 
control law, based only on local measurements of the field by each agent, drives the agents close to 
the optimal positions after the agents execute in parallel a number of sensing/movement/computation 
rounds that is essentially quadratic in the number of agents. Further, we exhibit a dynamic control 
law which, under slightly stronger assumptions on the capabilities and knowledge of each agent, 
drives the agents close to the optimal positions after the agents execute in parallel a number of 
sensing/communication/computation/movement rounds that is essentially linear in the number of agents. 
Crucially, both algorithms are fully distributed and robust to unpredictable loss and addition of agents. 

I. INTRODUCTION 

Widespread deployment of networks of sensors and autonomous vehicles is expected to 
revolutionize our ability to monitor and control physical environments from remote locations. 
However, for such networks to achieve their full range of applicability, they must be capable of 
operating in uncertain and unstructured environments without centralized supervision. Realizing 
the full potential of such systems will require the development of protocols that are fully 
autonomous, distributed, and adaptive in the face of changing environments. 

An important problem in this context is the coverage problem: a network of mobile sensors 
should distribute itself over a region with a measurable field so that the likelihood of detecting 
an event of interest is maximized. If the probability distribution of the event is uniform over 
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the area, then the optimal solution will involve a uniform spatial distribution of the agents. On 
the other hand if this probability distribution is not uniform, then the sensors should be more 
densely positioned in the subregions that have higher event probability. 

There has been considerable recent interest in the coverage problem, in part prompted by 
applications to the design of robotic networks for search and rescue, environmental monitoring, 
automatic surveillance, cooperative estimation, and other tasks (see survey below). However, 
it is still unknown how to design decentralized control laws for achieving optimal coverage 
which scale optimally or nearly-optimally with the number of agents in the network. In this 
paper, we do this for the nonuniform coverage problem in the case the agents are arranged on a 
line. Moreover, we establish that relatively minor increases in the capabilities of each agent can 
lead to an order of magnitude improvement in the convergence rate of the system in achieving 
coverage. 

A. Related work 

There is a considerable literature on coverage algorithms for groups of dynamic agents; we 
refer the reader to O, 01, El, E3, El, ED, EOl, 0, ID, ED, El, O, ED, El, EI, 
H291 11231 , 11261 , Bl, 0, (SI, HH, 0, [(3D, E21, E3, EH, S and the references therein. Here, 
we only survey the papers most closely related to our work. 

In [O, uniform coverage algorithms are derived using Voronoi cells and gradient laws for dis- 
tributed dynamical systems. Uniform constrained coverage control is addressed in ll24l where the 
constraint is a minimum limit on node degree. Virtual potentials enable repulsion between agents 
to maximize coverage and attraction between agents to enforce the constraint. In BUI , gradient 
control laws are proposed to move sensors to a configuration that maximizes expected event 
detection frequency. Local rules are enforced by defining a sensing radius for each agent, which 
also makes computations simpler. The approach is demonstrated for a nonuniform but symmetric 
density field with and without communication constraints. Further results for distributed coverage 
control are presented in [ Toll for a coverage metric defined in terms of the Euclidean metric with 
a weighting factor that allows for nonuniformity. As in El, the methodology makes use of 
Voronoi cells and Lloyd descent algorithms. The papers El, flU identified a class of non-convex 
regions for which the coverage problem may be solved by reduction to the convex case through 
a well-chosen transformation of the region. The papers El, El explored an optimization-based 



approach to some complex variations of the covering problem. 

The paper ll22ll considered the general nonuniform coverage problem with a non-Euclidean 
distance, and it proposed and proved the correctness of a coverage control law in the plane. 
However, the control law of Il22ll is only partially distributed, in that it relies on a "cartogram 
computation" step which requires some global knowledge of the domain. 

B. The coverage problem on the line 

We will consider a particular case of the nonuniform coverage problem when the agents are 
arranged on a line. There are two main reasons for our interest in this question. First, while 
this is arguably the simplest model within which one can study the coverage problem, even in 
this setting many of the most fundamental questions are open: for example, it is unclear how 
to design control laws for which the time taken or energy expended scales optimally with the 
number of agents. 

Secondly, the line coverage problem is motivated by several natural applications. One is 
the problem of border patrol, in which a collection of mobile sensors must adaptively spread 
themselves out over a one-dimensional curve; this is easily seen to be equivalent to the problem 
we consider here after a re-parametrization. Boundary tracking is a problem analogous to border 
patrol; in this case mobile sensors should adapt their positions to monitor the boundary of a 
crowd of people, a toxic spill, a forest fire, etc. Another application is adaptive sampling in two 
and three-dimensional environments where the nonuniformity in the sampled field is dominant 
in one dimension. For instance, in the air or in the water, nonuniform coverage in the vertical 
is a typical concern: vertical profilers or autonomous vehicles should distribute themselves over 
depth or altitude, while vertical lines of these sensors can be deployed in the horizontal plane in 
a static array or adapted using a distributed uniform coverage algorithm. An important example 
in the vertical water column (oceans, lakes, reservoirs, rivers) is mapping the nonuniform vertical 
temperature field and locating the thermocline, a thin layer that separates the warm surface layer 
from the cold deep layer; the depth of the thermocline is relevant in many applications, for 
instance, it is the depth at which the speed of sound is maximal. 



C. Our contributions 

Our work builds on the results of [|22| to design a control law for the nonuniform coverage 
problem in the one-dimensional case when the agents are positioned on the line. We develop fully 
distributed coverage control laws for a nonuniform field in this setting, and moreover, we prove 
quantitative convergence bounds on the performance of these algorithms. Interestingly, we find 
that relatively modest increases in the capabilities and knowledge of each agent can translate 
into considerable improvements in the global performance. These improvements are obtained 
by implementing more sophisticated variants of distributed algorithms; in particular, we make 
significant use of the technique of lifting Markov chains, introduced in ffT2ll . and subsequently 
used to accelerate distributed computation in [fT9l , I128TD . 

We begin with an introduction to the nonuniform coverage problem in Section II. In Section 
III, we present our first fully distributed control law for the coverage problem. The execution of 
this control law only requires the agents to be able to measure distances to their neighbors and 
to measure the field around their location. The main result of this section is Theorem 1, which 
demonstrates the correctness of the algorithm and gives a quantitative bound on its performance. 
We show that it takes n agents essentially on the order of n 2 parallel update rounds consisting 
of sensing, computation, and movement to come close to the optimal configuration regardless 
of the initial conditions. 

In Section IV, we present another fully distributed control law for coverage. The execution of 
this control law requires more capabilities on the part of the agents: they store several numbers 
in memory, communicate these numbers to their neighbors at every round, and moreover, they 
know approximately (within a constant factor) how many agents there are in total. Subject to 
these assumptions, we derive a considerable improvement in the number of update rounds over 
the simple static control law of Section III. The main result of this section is Theorem 6, which 
demonstrates the correctness of the algorithm and gives a quantitative bound on its performance. 
We show that it takes n agents essentially on the order of n parallel update rounds consisting of 
sensing, computation, communication, and movement to come close to the optimal configuration 
regardless of initial conditions. This is an order of magnitude improvement over the control law 
of Section III. 



II. NONUNIFORM COVERAGE 

We introduce the nonuniform coverage problem in this section; our exposition closely follows 
the expositions of ll22ll . J9]|. We consider n mobile agents initially situated at arbitrary positions 
xi(0), £2(0), . . . , x n (0) which, for simplicity, we henceforth assume to be located in the interval 
[0,1]. There is a positive, piecewise-continuous function p : [0,1] — > (0, 00), which measures 
the density of information or resource at each point. We assume that p is bounded both above 
and below, i.e., there exist positive numbers p max and p min such that for all z G [0,1], we have 
Pmin < p{z) < Pmax- The goal is to bring the agents from their initial configuration to a static 
configuration that allows them to optimally sense in the density field p. Intuitively, we would 
like more agents to be positioned in areas where p is high, and fewer agents positioned in areas 
where p is low. 

More formally, for a, b E [0,1] we define the metric 

i"max(a,6) 

d p (a, b) — p(z)dz. 

J min(a,6) 

It is easy to see this defines a valid metric between points in [0, 1]. Relative to the ordinary 
distance \a — b\, this metric expands regions where p is large and shrinks regions where p is 
small. 

Following B221, we define the coverage of a set of points x\, . . . ,x n relative to the density 
field p as 

. . . , x n , p) = max min dJy,Xi). 

1/6 [0,1] i=l,—,n 

Given the positions xi,...,x n of the agents and the density field p, computing $ requires 
computing the distance d p from any point in [0, 1] to the closest x«. The coverage metric $ is 
then the largest of these distances. A smaller $ implies the vehicles achieve better coverage of 
the domain [0, 1]. We use <3>* to denote the best (smallest) possible coverage 

$* = inf $(m,...,x n ,p). 

(x 1 ,...,x n )e[o,i} n 

It is not hard to see (and we will formally show later) that this infimum is reached at a point 
where no two of the Xi are the same. In this paper, we are concerned with designing control 
laws which drive agents towards positions with coverage <3>*. 

As pointed out in ll22ll . one may alternatively interpret the coverage problem in terms of 
optimal positioning with a nonuniform distance; this is closely related to information gathering 



and sensor array optimization problems. A typical problem is to minimize shortest response 
time from a collection of vehicles to any point in a terrain of varying roughness. In that case, 
the non-Euclidean distance d p appears because rougher bits of terrain take longer to traverse. 
Another such problem is the detection of acoustic signals; the objective is to place sensors so 
they can detect a source anywhere in an inhomogeneous medium. In that case, the non-Euclidean 
distance d p appears as a result of the spatially varying refractive index of the environment. 

III. A STATIC COVERAGE CONTROL LAW 

We now describe and analyze a simple distributed control law that drives the vehicles towards 
optimal coverage. First, we need to define the notion of a p-weighted median between points. 
Definition: The a-median m p (a : b) is defined as the point c G (a, 6) which satisfies 

c pb 

p(z)dz = a p(z)dz. 

Due to the strict positivity of p, it is easy to see that a unique such point exists for any a > 0. 

We can now state the coverage control law. We assume for convenience that agents are labeled 
l,...,n from left to right. This makes it easier to state what follows; however, the actual 
implementation of the algorithm does not require the use of these labels. 

A static coverage control law: the agents iterate as 

x 1 (t + l) = m 1 J 2 (0,x 2 (t)) 

Xi(t + 1) = mj(xj_i(i),x m (t)), i = 2,...,n-l 
x n (t + l) = m 2 p {x n -i(t),l) 

We first briefly outline how this scheme may be implemented without knowledge of the labels 
1, . . . , n by the nodes. A node i will initially check whether it has a left neighbor and a right 
neighbor, or whether it is a "border agent" with a single neighbor. Suppose it has two neighbors. 
Then, it will measure the distance d L to its left neighbor and d R to its right neighbor, and denoting 
its position (which it does not know) by Xi, it will measure p in the interval [xi — d L , Xi + d K }. 
This gives it enough information to compute the 1 -median of the positions of its neighbors, 
and it moves to this location. "Border agents" can similarly implement this control law without 
knowledge of their labels. 



Next, we remark that this scheme may be interpreted as a distributed implementation of the 
cartogram approach introduced in 11221 specialized to the line. Intuitively, each of the middle 
nodes 2,...,n — 1 seeks to position itself "in the middle" of its neighboring agents while 
stretching areas with high p and shrinking areas with low p; this is precisely the distributed 
computation of the cartograms used in ll22l . 

Our goal in this section is to prove that this iteration solves the coverage control problem 
and to provide quantitative bounds on its performance. The main result of this section is the 
following theorem. 

Theorem 1: Each Xi(t) has a limit, and the limiting set of positions have coverage $*. Moreover, 
after 0(n 2 log(^y^)) rounds, each agent is within e of its final limit. 

We next turn to the proof of this theorem. We first write down the optimality conditions for 
achieving $*. 
Lemma 2. The equations 



location x\ G (0, 1) satisfying d p (0,xi) = d p (0, l)/(2n). Such a location clearly exists. Next, for 
every i > 1, we position the z'th agent at the location xi which satisfies d p (xj_i, Xi) = d p (0, l)/n. 



2d p (0,x 1 ) = d p (x 1: x 2 ) = ■■■ d p (x n -i,x n ) = 2d p (x ni 1) 



(1) 



have a unique solution which achieves coverage $*. Moreover, 




Proof. It is easy to see that Eq. ([T]) has a solution. Indeed, suppose we position agent 1 at the 



Again, such a point clearly must always exist. The resulting locations satisfy Eq. 
Moreover, this solution is unique because together with the obvious identity 



n-l 




i=l 



Eq. ([T]) implies 



d p (Q, 1) = d p (0, xx)(l + 2(n - 1) + 1) 



which immediately fixes the value of x\, and hence all the values Xi. 



Under this configuration the largest distance d p between any point in [0, 1] and its closest 
agent clearly equals d p (0, l)/(2n). We next argue that $* is at least this big. Consider any set 
of positions x\ < x 2 < ■ ■ ■ < x n . Since 

d p (0, 1) = [d p (xi,x 2 )] + [d p (x 2 ,x 3 )] H h [d p (x n -i,x n )] + [d p (0, xi) + d p (x n , 1)] 

and there are n terms in brackets, we can conclude that at least one of the bracketed terms is at 
least d p (0, l)/n. 

If the only such term is the last term, then one of d p (0,xi), d p (x n ,l) has value at least 
d p (0, l)/(2n). But this implies that the distance d p from either or 1 to the closest agent is at 
least d p (0, l)/(2n), which proves that in this case $* is at least that big. 

If, on the other hand, it is some term d p (xi, x i+ i) which is at least d p (0, l)/n, then the distance 
dp from the median m p (xi, x i+1 ) to the closest agent is at least d p (0,l)/ (2n), which again implies 
$* is at least this much, q.e.d. 

We next introduce a change of variables which makes our static control law easier to analyze. 
We define 

F{x) = I p(z)dz, 
Jo 

and note that F(l) = d p (0, 1). Moreover, for any two points a < b in [0, 1], 

d p (a,b) = F(b) -F(a), 

and 

FKW))^». (2) 
Remark: We note that under the execution of the static control law, we have that for all t, 

x x {t) <x 2 (t) <---<x n (t) 

That is, the execution of the static control law preserves the ordering of the agents. To show 
this it suffices to demonstrate that for all i,t, 

F(xi(t)) < F(x i+l (t)) 

which follows by induction from Eq. and the algorithm defintion. 

The following simple corollary of Lemma 2 describes how the nonuniform coverage problem 
may be made uniform by a transformation and connects our approach to the "nonuniform 
cartogram" approach of fl22j. 



Corollary 3: The positions x 1: . . . , x n achieve optimal coverage $* if and only if F(xi), . . . , F(x n ) 
achieve optimal coverage in [0,F(1)] with density equal to 1. 

The next lemma restates our coverage control law in a particularly convenient form. 
Lemma 4: Assuming there are at least two agents, let us define 

do(f) = 2F( Xl (t)) 

di(t) = F(x i+1 (t)) - F( Xi (t)), i = 2,...,n-l 
d n (t) = 2(F(1) - F(x n (t))) 

and let d(t) be the vector in W l+1 which stacks the variables dj. Finally, we define 



^-4 4 ^ 



U« = 



V 
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and for n > 5, 



/ -4 4 

2-5 3 

3-6 3 



t4 = 



\ 



V 

Then d(t) follows the dynamics 



3-6 3 

3-5 2 
4 -4 



d(t + l) = (I+-U n+1 )d(t). 



Proof: We give the calculation in the case of n > 4; the proof of the cases n = 2, 3 are similar. 
Define yi(t) = F(xi(t)). From Eq. we can conclude that the variables y^t) evolve as 

yi{t) 



yi{t + i) 



3 



*<t + U = y - l(t) ^ M " ) . i = 2,...,n- l 

We can rewrite the variables d^t) in terms of the variables yi(t) as 

d (t) = 2 Vl (t) 

diit) = Vi+iif) - yi(t), i = 2,...,n-l 

d n (t) = 2(F(1) - y n (f)) (3) 
yi (t + l) = Vl (t) + 



Observe that 



di(t) 


- do(t) 




3 


di(t) - 






2 


d n (t) 


- d n _i(t) 



+ = yi(t)+ — 2 w i = 2,...,n-l 

/. , -,\ us i d n {t) - d n _i(t) 
y n (t + 1) = j/„(t) H (4) 

and replacing i by t + 1 in Eq. ([3]) and then plugging in Eq. Q 

2{d 1 {t) - d {t)) 



d (t + 1) - d (t) 
di(t + l) -rfi(t) 



3 

d 2 (t)-d 1 (t) _ dt(t) -d (t) 
2 3 



d t (t + 1) - dl (t) = d^(t)-dM _ d i (t)-dUt) j z = 2) ... ;n _ 1 

, 1 4- \ i \ j t±\ _ d n (t) — d n -i(t) d n -i(t) — d n _ 2 (t) 
dn-iyt + i-j — a n -i\t) — — 



d n {t + 1) - d n (t) 



3 

2(d n (t) - d n ^(t)) 



or 



+ = -U n+1 d(t). 

o 



q.e.d. 



Lemma 5: Let k > 3 and let P k = I + £4/6. Then the spectrum of P k is real. Labeling it from 
smallest to largest as \{P) < • • • < A 2 (P) < Ai(P) = 1, we have 

max(|A fe (P)|,|A 2 (P)|)<l-^. 

Proof: For k = 3, 4 the lemma follows by an easy calculation. For general k > 5, consider an 
undirected line graph on fc nodes with self loops at each node. We assign weights to the edges 
as illustrated in Figure 1. Namely, the self loops on nodes 1, 2, k — 1, and k have weight 1; all 
other self-loops have weight zero. The edge between nodes 1 and 2, and the edge between node 
k — 1 and k each has weight 2. Every other edge has weight 3. We denote by Wij the weight 
of the edge between % and j (defined to be zero if there is no edge between % and j); since the 
graph is undirected, = Wji by definition. Moreover, we define Wi to be the sum of all the 
weights incident on node i, i.e. Wi = YTj=\ w ij- Thus, {w\, w 2 , ■ ■ ■ , w n , w n+ ±) = (3, 6, . . . , 6, 3). 
With these definitions in place, we observe that for all = w^jwi. 

110 Oil 

o o o o o 









Fig. 1. The weighted graph capturing the transition matrix P k for k > 5: Pij = w^/wj where Wij is the weight of the edge 
and Wi is the sum of all the weights incident on node i. 

Define the inner product (x,y) = J2i=i w i x iyi- Then, 

n n 

(x,Py) = w ij x iVj = Yl w H x iV3 = ( Px >y), 

i,j=l i,j=l 

so that P is self-adjoint in this inner product, and in particular its spectrum must be real. 
Observing that the largest eigenvalue of P is 1 with the eigenvector of all ones, the Courant- 
Fisher theorem gives 

A 2 (-P) = max (x,Px) 

{x,l)=0,{x,x) = l 



max 2, WijXiXj 



lid 

Wi 



2 Ji WiXi=0,2_,i Wixf = l Z 



= 1 — min y, w ij( x i ~ x jY 



1- min ^ w l , i+ i(x l+ i - Xj) 2 . 

1=1 



fc-1 

We now lower bound the last term on the right hand side using a variation of the argument from 
[2T] . The minimum is achieved (since we are minimizing a continuous function over a compact 
set); use the notation y for the minimizer. Consider the vector with z'th entry WiUi. Let M be 
the index which maximizes Wiyf; without loss of generality, we may assume that yu > (if 
not, we can replace y by — y). Let m be the index of the smallest entry; we may also assume 
without loss of generality that m < M. Observe that the constraint £\ u>^ = implies that 
y m < while the constraint Yli w iy'i = 1 implies that the average value of Wiyf is l/k. This 
means 

wmVm > 1A 



or 



Vm 



> l/^w M k > 1/VEk. 



Thus 



Applying Cauchy-Schwarz, 



I M-l 

7= < Vm - Vm = ^2 Vi+1 ~ Vi - 



j M-l 

i=m 

and therefore, 

fc-1 M-i 2 i 

i=l i=m 

Putting it all together, this implies the desired bound on A 2 . 

A similar argument (which is also a variation of an argument from 112110 proves the bound 



for A fc . By the Courant- Fisher theorem 



Afc(-P) = min (x,Px) 

{x,x)=l 



min > wnXiXj 
>;. <■:■■■■ i .4^ 

-^-{{Xi + xj) -x t - Xj 

fc 

= -1+ min Wjj(xj + x 3 ) 2 + 2w iti x 2 

y . wix 2 =i ' — ^ 

^* ! 1<J 2=1 

fe-1 

= -1+ min ^ w l .i +1 (x i+1 - Xjf 

+ + 2x 2 + 2x\_ x + 2x\. 

where in the final step we can replace (x i+ i+Xi) 2 with (x i+ i— Xi) 2 since that flipping the sign 
of every other x^ does not change the constraint Yl,i w i x 'i- Next, as before let y be the vector 
achieving the optimum in the above expression, and let wmDm be the largest among all the 
numbers wiyi. As before, yu > 1/VQk. If M — 1 or M — 2, the above equation immediately 
proves the lemma. If M > 2 

< Vm = V2 + (V3 - Vz) H \-(vm -Vm-i), 



'6k 

so that by Cauchy-Schwarz 



1 " — i 

— < k{y 2 2 + - t/fc) 2 ), 



6k 

fc=2 

so that 

fe-i fe-i ^ 

min V"w M+ i(x i+ i-Xi) 2 +2xi+2x2+2a; 2 _ 1 +2x^ > 2 min xl + y2(x i+1 -Xi) 2 > — , 

which implies the corresponding bound on \ k . q.e.d. 

Proof of Theorem 1: Decompose d(0) in terms of the eigenvectors of P as 

n+l 
j=l 

where v { corresponds to eigenvalue Aj. We know that Vi — 1. Thus 

n+l 
i=2 



Using the fact that the eigenvectors v,i are orthogonal in the inner product (•,•), 
(d(t) - Cl l,d(t) - Cl l) < (1 - 3(n | 1)2 ) 2t (d(0) - cil,d(0) - cil) 

or 

- ci) 2 < ±(1 - 3(n ^ 1)2 ) 2f (^(0) - cil, d(0) - cil>. 

Observe that 

rt+l n+1 

(d(0) - cil, d(0) - cil) < (d(0), d(0)> < 6 ^ d?(0) < 6( J] d,(0)) 2 < 6F(1) 2 

i=l i=l 

so that 

* ) - Cl)2 - 2(1 -^ J?(1)2 

It follows that each rfj(t) has a limit (which equals ci), and this immediately implies that every 
Viit) has a limit, and therefore every Xi(t) has a limit. Most importantly, because every di(t) 
approaches the same limit, we can conclude that the optimality condition of Lemma 2 is satisfied 
in the limit, and the set of limiting positions achieves $*. 

Moreover, after |(n + l) 2 log(2F(l) 2 n 2 /e 2 ) iterations, each di(t) is within e/n of its final 
value, which implies every y^t) is within e of its final value. 

Now the static control law is invariant under scaling of p, so we can assume without loss of 
generality that we are dealing with p — p/p m - m , so that the minimum value of p is 1. Then, we 
conclude that after 3(n+ l) 2 log(\/2 p(z)dz n/e) iterations, every Xi(t) is within e of its final 
value. To conclude the proof, observe that p(z)dz < p max / p mm , and the upper bound in the 
theorem statement follows, q.e.d. 

Remark: Observe that the coverage control law we have presented in this section is naturally 
robust to addition and deletion of agents as well as changes in p. Indeed, as long as the density 
and the number of agents stop changing at some point, the algorithm is guaranteed to converge 
to the optimal configuration. An open problem is to prove performance bounds for this algorithm 
in the scenario when the number of agents and the density are continually changing. 

IV. A DYNAMIC COVERAGE CONTROL LAW 

In this section, we propose another control law for the nonuniform coverage problem on the 
line. Our main result is that this control law converges an order of magnitude better than the 



static control law of the previous section; in particular, it drives the agents close to the optimal 
positions in a number of communication/movement rounds that is essentially linear in n for each 
agent. This scaling is optimal in the sense that any control law requires at least linearly many 
discrete-time rounds of communication to drive agents close to their optimal positions. 

We draw heavily on the paper [fT2l . which described a fast "nonreversible Markov sampler" 
for sampling a uniform random number from {1, . . . , n}. The main technical component of our 
result is a modification of the results of [12J; specifically, we use the ideas of |[T2l to modify the 
nonreversible Markov sampler to handle the nonuniform sampling problem of coverage control. 

We make stronger assumptions on the capabilities and knowledge of each agent. We now 
assume that agents can store numbers in memory, transmit numbers to their neighbors, and 
can detect when their neighbors move to a new location. However, every node only stores 
and transmits/senses two numbers in each of the control law iterations, so the additional effort 
expended is not excessively onerous. Moreover, we assume that each agent has an estimate U 
of the total number of agents, and that this estimate is accurate within a constant factor c which 
does not depend on n: 



Finally, we assume for simplicity that the number of agents is at least 3, and consequently U > 3 
as well. 

We refer to the control law of this section as the "dynamic coverage control law," since 
in contrast with the control law of the previous section, the feedback law has dynamics of 
its own. This control law follows two steps: an initial measurement step and a subsequent 
communication/measurement/movement step. 

Dynamic control law: Nodes keep track of the variables Zi(t), Zi>(t), initialized in the first step 
as 



n 



<U < cn. 



(5) 



c 



1 



/ 

Jo 



mj (on (0) ,0:2(0)) 



zi(0) 



2 



p(z)dz 





1/2 



1/U 




(1/2X1-1/U) f \ (1/2X1-1/U) 

n-1 i n 



(1/2)(1-1/U) V" V i" 11 -' 



l/(2Ur\\ 1/(2U)\\ T 



1/U 




1/2 



Fig. 2. The Markov chain representing the update equations of the dynamic coverage control law. 



and Zj/(0) = Zj(0) for each i At each step, nodes transmit their variables Zi(t),Zi>(t) to their 
neighbors, and then set their values Zi(t + 1), Zi>(t + 1) to be linear combinations of their previous 
values and the values they have just received. The linear combination taken by each agent is 
derived from a rule based on Figure 2. Note that this figure contains nodes labeled 1, . . . , n and 
1', . • • ,n'. Node % sets Zi(t + 1) to be a linear combination of those values Zk(t) which have 
edges going from k to %\ the coefficient it puts in front of z k (t) is the label on the edge. The 
value of Zi'(t + 1) is determined in the same way. For example, agent 1 updates as 

Zl (t + i) = (1-^)^(0 + ^7^(0 
z'iit + 1) = ^(i-^)^(0 + ^i'(0- 



After updating their variables z^zy, some of the agents execute motions according to the 
following rule. Initially, at round 1 the leftmost agent initiates the motion by moving to the 
point c satisfying 




p(z) dz = zi(l) + zi>(l). 



If this number c is higher than x 2 (l), then agent 2 also moves to point c, as well as all other 
agents whose positions at time 1 are less than c. This may be implemented in the straightforward 
way: when agent 1 reaches the location of agent 2, the two begin moving together. 



Generally, after agent i initiates the motion at round t, agent i + 1 initiates the motion at time 
t + 1. It moves to the position c which satisfies 



and every agent whose position is strictly less than c moves to position c alongside it. Every U 
steps, the leftmost agent initiates the motion once again and the process repeats itself. 

Remark: We do not consider the issue of collision avoidance in this paper, and thus the 
description of the algorithm above allows two agents to share the same location. It is possible to 
modify the dynamic control law so that no two agents are ever in the same place by introducing 
a tolerance e so that node i begins moving whenever node z — 1 is within e distance of it. For 
reasons of simplicity we will not be considering these issues here. 

While the update rule of the dynamic control law is somewhat involved, it has a simple 
interpretation. Consider the Markov chain of Figure 2 where we interpret the label on the edge 
as the probability of transitioning from i to j; note that the labels on the outgoing edges 
sum to 1 for every node. If Zi(t) is the probability of being at node i at time t, and z[(t) is 
the probability of being at node i' at time t, then the variables Zi(t),Zit(t) satisfy the above 
recursion. 

Indeed, this recursion is an adaptation of the "nonreversible Markov chain sampler" from 021. 
It was observed in that paper that while an ordinary "diffusive" Markov chain on the line graph 
which, say, moves to the right and left each with probability 1/2 takes on the order of n 2 steps to 
come close to the uniform distribution, a "guided" Markov chain which moves clockwise round 
the circle with probability 1 — 1/n and transitions to a symmetrically placed node with probability 
1/n gets close to the uniform distribution an order of magnitude faster. The dynamic coverage 
control law of this section is an attempt to harness this insight for the coverage problem. 

Our goal in this section is to prove that this iteration solves the coverage control problem and 
to demonstrate that it is an order of magnitude better as far as number of update rounds than 
the static control law of the previous section. The main result of this section is the following 
theorem. 




Theorem 6: Each Xi(t) has a limit, and the limiting set of positions has coverage $*. Moreover, 



after every agent has executed 0(n log ( Pmax " )) rounds of updates, each agent is within e of its 
final limit. 

Remark: We remark that while this is an order of magnitude improvement in number of update 
rounds, understood here as a step of sensing, communication, computation, and movement exe- 
cuted in parallel by all the nodes, it is not necessarily an improvement in speed or energy. Indeed, 
an update round in the dynamic control law includes communication between neighboring agents, 
while an update round in the static control law does not. Moreover, the time it takes to execute the 
motion in each round in the two algorithms is not immediately comparable; note that, assuming 
that each node can travel at unit speed, executing the motions takes max" =1 Xj(t + 1) — Xi(t) 
time in the first algorithm and Xj(t + 1) — Xj(t) in the second, where j is the index initiating 
movement at time t. However, since the two algorithms send agents to different positions, the 
movement times are not immediately comparable. Whether the dynamic algorithm is in fact 
faster in time or takes less energy will depend on the precise speed and energy requirements of 
sensing, communication, computation, and movement. Our main results, in Theorems 1 and 6, 
only demonstrate the weaker statement that the number of rounds in the dynamic algorithm is 
an order of magnitude better. 

Nevertheless, we note that under some additional assumptions, an improvement in the number 
of rounds does translate into an improvement in speed and energy. For example, if the time taken 
and energy expended are dominated by sensing and communication, and if the acts of sensing 
the field p at each step and exchanging messages with neighbors take similar expenditures of 
time and energy, then an order of magnitude improvement in the number of rounds immediately 
translates into an order of magnitude improvement in speed and energy. 

The general structure of our proof is similar the proof of the related results from lfT2ll . 
Specifically, we prove the theorem by first establishing a certain inequality on the state of the 
Markov chain of Figure 2, namely, Eq. ([8]); the proof of [TT2l proceeds in the same way. The 
meaning of Eq. ([8} is that after 0(n) steps, the state of the chain is not too far from uniform in a 
certain sense. Due to the somewhat asymmetric nature of our chain, our proof of this inequality 
is considerably more involved than in [TT2l and occupies the bulk of this section. Thus we begin 
with an extended series of lemmas whose goal is to prove Eq. ([8]); these lemmas essentially 
describe the action of the probability transition matrix in a slightly different way which is more 



convenient for the proof of Eq. ([8]). Once Eq. ([8]) is proved, Theorem 6 will follow by a standard 
argument in probability. 

We now proceed with our proof. Let z(t) denote the row vector ( z\(t) ... z n (t) z\> (t) ... z, 
Let K denote the matrix which maps z(t) to z(t + 1) through right-multiplication: 

z (t + 1) = z(t)K. 

Observe that K is a nonnegative, irreducible stochastic matrix. Standard results in Markov 
chain theory imply that the above iteration converges to a scaled multiple of the stationary 
probability vector of the chain. Thus we can immediately conclude that each Zi(t) has a limit, 
and consequently, the positions of the agents under the dynamic coverage control law have limits 
as well. 

Lemma 7: The stationary probability of the Markov chain in Figure 2 is 

1 

7T 1 = -Jiy = vr n = 7T n / = — 

An 

and for all other nodes, 

1 

2n 

This lemma can be proved by verifying that the stacked vector n satisfies tt t K = n T . 
This lemma implies that 

F(l) 

lxmzi(t) = lim^^j = limz n (t) = lini2: n /(t) = — ; — , (6) 

t t t t 

and 

F(l) 

]hnz i (t)=]hnz i ,(t) = -^-, (7) 
t t in 

which implies by Lemma 2 that the limiting set of positions do have optimal coverage $*. It 

remains to bound the time until the agents approach these positions. We begin with several 

definitions and lemmas which we make use of later. 

Definition: Let Aiyi\, . . . , njt) be the set of sequences of symbols Ai, A 2 , . . . , Ak in which the 
symbol A k appears n^ times. 



Lemma 8: Consider the following process of randomly generating elements from A(ni, . . . , njt). 
Begin with the sequence of A\ repeated n\ times; then place an A 2 at a uniformly random slot, 



where a slot is either between the letters already in place, or before the first or after the last 
letter; repeat the process n 2 times. Finally, perform the analogous operations for A 3 , . . . , A k , that 
is, place an A 3 at a uniformly random slot n 3 times, place an A 4 at a uniformly random slot n 4 
times, etc. Then every sequence in A(n 1: . . . ,n k ) has equal probability of being generated by 
this process. 

Example: The proof of this lemma will be made more intuitive by first considering an example. 
Consider the sequence AiA 2 A 2 A x A 2 E .4.(2,3). It may be generated with this process in 
exactly six sequences of insertions, shown below. The number before each sequence is the 
slot (enumerated from left to right) into which the next letter will be inserted. 

2.A 1 A 1 2.A 1 A 1 2.A 1 A 1 2.A 1 A 1 3.A X A X 3.A X A X 

2.A 1 A 2 A 1 2>.A X A 2 A X A.A X A 2 A X 4.A X A 2 A X 2.A X A X A 2 2.A 1 A 1 A 2 

h.A x A 2 A 2 A x h.A x A 2 A 2 A x 2.A 1 A 2 A 1 A 2 3.A X A 2 A X A 2 2.A 1 A 2 A 1 A 2 3.A X A 2 A X A 2 

A 1 A 2 A 2 A 1 A 2 A 1 A 2 A 2 A 1 A 2 A 1 A 2 A 2 A 1 A 2 A X A 2 A 2 A X A 2 A 1 A 2 A 2 A 1 A 2 A 1 A 2 A 2 A 1 A 2 

Note that each of the above sequences has probability (1/3) (1/4) (1/5) of being generated by 
the process of Lemma 8. 

Proof: Observe that the insertion of a symbol into a slot replaces it with two slots, so that every 

choice of slots has the same probability. It therefore suffices to show that any sequence can be 

generated in the same number of ways using the process described in this lemma. 

Consider the reverse process, i.e., the sequence of deletions beginning with the final sequence 
- first of A k 's, then of A k -i's, and so on - until we are left with the sequence of n x A\%. For 
example, the possible deletions of the sequence A 1 A 2 A 2 A 1 A 2 are 

h.A x A 2 A 2 A x A 2 h.A x A 2 A 2 A x A 2 2.A X A 2 A 2 A X A 2 3.A X A 2 A 2 A X A 2 2.A X A 2 A 2 A X A 2 3.A X A 2 A 2 A X A 2 

2.A X A 2 A 2 A X 3.A X A 2 A 2 A X A.A X A 2 A X A 2 4.A X A 2 A X A 2 2.A X A 2 A X A 2 2.A X A 2 A X 

2.A X A 2 A X 2.A X A 2 A X 2.A X A 2 A X 2.A X A 2 A X 3.A X A X A 2 3.A X A X A 2 

A X A X A X A X A X A X A X A X A X A X A X A X 

where the number next to the sequence is the index of the letter being deleted. As the above 

example illustrates, every sequence of insertions naturally corresponds to a sequence of deletions: 

if one can obtain sequence s 2 by insertion of a letter into slot j of sequence si, then one can 

obtain s x by deleting the j'th letter from s 2 . Thus, for any sequence, there are as many ways to 

generate it through the insertion process of this lemma as there to generate the sequence of A 1 

repeated n x times by this deletion process, and there are plainly n k \n k - X \ ■ ■ ■ n 2 \ ways to do the 



latter. Since the latter number does not depend on the particular sequence, this completes the 
proof, q.e.d. 

Definition: Consider the Markov chain obtained by adding a self-loop of probability 1/2 to 
nodes 1, 1', n, n' in Figure 2 and scaling the probabilities on the outgoing links from those nodes 
by a factor of 1/2. We refer to it as "the modified Markov chain" to distinguish it from the 
Markov chain of Figure 2, to which we henceforth refer as "the original Markov chain." The 
modified Markov chain is appropriately viewed as a certain random walk on a multigraph, as 
nodes 1' and n have two self-loops. 

Definition: Observe that every node in the modified Markov chain has three outgoing edges: a 
self-loop with probability 1/2, a link with probability 1/2 — \/{2U), and a link with probability 
1/{2U). We refer to the first as "major self loops," to the second as "continuing links," and to 
the third as "switching links." Note that the switching links originating at nodes 1' and n are 
self-loops. 

Lemma 9. Let T(a,j) be the event that starting at node a at time t = 1, the modified Markov 
chain satisfies the following conditions during times^ 1, . . . ,4n — 1: 

1) The chain has not taken major self-loops at nodes 1, l',n,n' . 

2) A single switching link was taken, at node j. 

Define L k be the event that k major self-loops were taken by time t = An. Then there is some 
constant d which does not depend on n and U such that 

P(T(a,j) I L k ) > — for all a,j and k < 2n - 2. 

c'n 

Proof: We first outline the general structure of the proof, which proceeds by repeated condi- 
tioning. Instead of conditioning on L k , we will instead condition on the specific set of k times 
when the modified chain took a self-loop (we call this L(An)); specifically, we argue for any 
L{An) there is a constant probability that exactly one switching link was taken by time An. 
We then argue that conditional on L(An) and a single switching link being taken by time An, 
the probability that that link was taken at node j and that no self-loops were taken at nodes 



'Henceforth, we will shorten this statement as "by time 4n." 



1,1', n, n' is proportional to 1/n (this last step crucially relies on Lemma 8). This proves the 
current lemma. 

We now proceed with the proof. As just mentioned, we define L{An) to be the set of times in 
{1, . . . , 4n — 1} when the modified Markov chain takes a major self-loop. Because the distribution 
of the number of major self-loops taken by the modified chain is exactly that of the times a 
fair coin lands on heads in An — 1 tosses, we have that conditional on L k , L(An) is uniformly 
distributed over all /c-element subsets of {1, . . . , An— 1}. Similarly, conditional on any A;-element 
L(An), the distribution of the number of switching links is exactly that of the number of times 
a coin whose heads probability is l/U lands on heads exactly once in An — 1 — k tosses. So, 
if S is the event that a single switching link was taken by time t = An, then for any /c-element 
L(An), 

P(S\L(An)) = (4n - 1 - fc)±(l - ^ M > |((1 - If )^ > 2c{\)^, 

where we used the bound k < 2n — 2 and that (1 — l/x) x is increasing for x > 1 while U > 3. 
It follows that conditional on L k , S occurs with constant probability independent of n, U. 

Moreover, conditional on both L k and S, the sequence of continuing links, switching links, 
and major self-loops has uniform distribution over the set of sequences with k major self-loops, 
a single switching link, and An — 2 — k continuing links. By Lemma 8 we may generate this 
sequence by writing down An — 2 — k C's (corresponding to continuing links), inserting an 
S (corresponding to a switching link) in a uniformly random slot, and then inserting k L's 
(corresponding to major self-loops) in uniformly random slots. To conclude the proof of the 
lemma, we next argue that for any node j, with probability c"/n, (where c" is some constant 
independent of n, U) this process produces a sequence which takes a switching link at node j 
and avoids self-loops at nodes 1, V ,n,n' . 

Indeed, there are An — 2 — k > 2n continuing links (where we used that k <2n — 2), so every 

node is visited at least once, and the probability of inserting a switching link in the corresponding 

slot is at least l/(4n). Moreover, because there are at most An continuing links and at most one 

switching link, the number of times the chain can enter the sets {n', 1} and {n, 1'} is at most 5. 

Thus we avoid creating a major self-loop at 1, l',n,n' if and only if we avoid at most ten slots 

every time we insert an L into a random slot. When n > 6, the probability of this is at least 

10 10 , 1 .,. 



To summarize, for n > 6, the probability of placing a link at j and avoiding major self-loops 
at nodes 1, l',n,n' conditional on L k , S is at least (l/(4n))(l/6) 24 . This proves the lemma for 
n > 6, and it holds automatically for n = 3, 4, 5 (because for these n, the probability in question 
is positive, and so can be assumed to be lower bounded by a constant which can be taken to be 
independent of n, U). q.e.d. 

Lemma 10: Let T(a, b) be the event that: 

i. The modified chain starts at node a at time t = 1 and is at node b at time t = An. 

ii. The modified chain has not taken the self loops at nodes 1, l',n,n' before time t = An. 
If k is at most 2n — 2 and has the same parity as b — a, then there is some j (depending on k) 
such that 

T(a,j)DL k GT(a,b)r)L k . 

Proof: In words, this lemma says that conditional on having taken a certain number of self-loops 
which is at most 2n — 2 and has the right parity, T(a, b) contains some T(a,j). To prove this, 
label the nodes of the modified Markov chain 1, . . . , 2n counterclockwise, and let p(t) be the 
label of the state of the modified Markov chain at time t taken mod 2n. This simply means that 
we will refer to state 2n as state zero, but the advantage of such a labeling is that some facts 
become easier to state in terms of addition mod 2n. In particular, a major self loop does not 
change p(t), a continuing link adds 1 to p(t), and a switching link taken at node n + z adds 
-2z to p{t). 

With that in mind, we argue that a chain beginning at node a and by time An taking k major 
self-loops, one switching link at node n + z, where z is specially chosen, and continuing links 
for the remainder of the time will be located at b at time t = An. This will prove the lemma. 

Indeed, putting off the choice of z for the moment, we have that 

p(An) = a + (An - 1 - 1 - k) - 2z. 
Thus p(An) = b if and only if 

An - 2 - k - 2z = b - a. 

Because k has the same parity as b — a, this equation has a solution and so a satisfactory z can 
always be found, q.e.d. 



Lemma 11: 

P(T(a, b)) > — for any a, b, 
d'n 

where c" > is a constant which does not depend on n and U. 

Proof: If b — a is even, 

P(T(a,b)) > P(A; self loops taken)P(T(a, b) \ k self loops taken) 

k even, k<2n— 2 

> P(fc self loops taken)P(T(a, j) \ k self loops taken) 

k even, k<2n— 2 



> — — ^ P(A; self loops taken) 



en 

k even, k<2n— 2 

1 

> . 

where the transition between the first and second line used Lemma 10 and the transition between 
the second and third used Lemma 9. The case when b — a is odd is similar, q.e.d. 

With the last lemma in place, we are finally able to turn to the analysis of the original Markov 
chain from Figure 2. We find it convenient to use the largest I 1 distance between the rows of 
K l and their ultimate limit as a measure of convergence. In particular, let [K l )i refer to the i'th 
row of K l , and let 

v(t) = max IK-fT*^ - 7r||i. 

i 

We use t(n, e) to denote the time until v(t) permanently sinks below e. 
Lemma 12 t(n,e) = 0(n log 1/e). 

Proof: By Lemma 11, the modified Markov chain starting from node a at time t = 1 has a 
1 / (d'n) chance of being at any node b at time t = An and avoiding the major self-loops at nodes 
1, 1', n, n' while getting there. But all choices of links satisfying these two conditions have a 
higher probability in the original Markov chain which has no major self-loops at 1,1', n,n'. 
Consequently, if s(t) is the state of the original Markov chain at time t, then 

P(sUn) = b I s(l) = a) > — for all a, b, n, (8) 

d'n 

where c" is the same constant as in Lemma 11. It is not hard to see (and was shown in |[T2lO 
that Eq. ([8]) implies the conclusion of this lemma, q.e.d 



Lemma 13 After 0(nlog(nF(l)/e)) steps, we will always have 

\ Zi (t)-^F{l)\<^-. 
Proof: We show that after 0(n \og(n/e')) iterations, the inequality 

\ Zi (t)- *iF(l)\ < ^F(l) (9) 

is satisfied. Taking e' = e/F(l) in this statement yields the lemma. 

To prove Eq. ([9]), observe first that scaling the density p multiplies both sides by the same 
number so that we may assume without loss of generality that F(l) — 1. In this case, Zi(t) is the 
probability that the random walk that starts at node k with probability Zfc(O) is at node i at time 
t. This is a convex combination of the entries of the i'th column of K*. By the previous lemma, 
each entry of the z'th entry of each row is not more than e'/(2n) from i\i after 0(nlog(n/e')) 
steps, and the convex combination of these entries must have the same property, q.e.d. 

Proof of Theorem 6. Since we have already shown that the positions of the agents have limits 

which achieve optimal coverage, all that needs to be argued is that every agent is within e of its 

final position after 0(n log(p max n/(p min e))) rounds. 

Since the physical locations of the agents throughout the execution of the dynamic control law 

do not change if we scale the density p, we may assume that our density is p'(z) = p(z)/p m i n . 

Note that F'(l) = J Q p'(z)dz satisfies F'(l) < p meLX / p mm . We now apply the previous lemma 

to the density p', and obtain that after 0(n log(p max n/(p min e))) updates, we will always have 

/x / * F(l). e 
An n 

\z i (t) + z i ,(t)- F ^-\ < -, i = 2,...,n-l 
n n 

. v , , F(l). e 
in n 

which combined with Eq. (j7]) and Eq. ^ implies that each Zi(t) + Zi>(t) is always within e/n 
of its final position after t\ = 0(n log(p max ra/(p min e))) iterations. 

Denote the limiting position of agent i by ~xl and the limiting value of Zi(t) + Zi'(t) by ~zi. At 
some time t[ G [ti,ti + U] agent 1 is going to initiatate motion. Then at t = t[ + 1, 

p'{u)du\ = \z\(t — 1) + Zi'(t — 1) — Zi\ < 

n 



xi(t) Jo Jo 



p'(u)dz\ = | / p'{u)du — I p'{u)du\ = \zi(t — 1) + zy{t — 1) — Z\\ < 



and since p'(z) > 1, this implies in particular that \xi(t) — x~[\ < e/n. Moreover, this remains 
true for all t after t[ + 1; indeed, it does not change when agent 1 is not initiating motion, and 
when it is by exactly the above chain of inequalities. 
We next argue that for alH > t x + i, we have 

ie 



f 

J Xi 



p'(u)du\ < 



n 



>Xi(t) 

which similarly implies \xi(t) — x~\ < ie/n. We proceed by induction; indeed, we have already 
proven the base case of % — 1 and now assuming we have proven this for i = I, we have that at 

time t[ + I + 1, 

rxl+i r x i(t) 



fXl(t) PXl f x l+l 

p'(u)du\ = I / p'(u)du+ / p'{u)du+ I p'(u)d 

Jx l+1 (t) Jx t (i) JxJ 

pxi r x i+i(t) r x i+i 

< I / p'(u)du\ + \— p'{u)du + / p'{u)du\ 

J x t (t) J xi(t) JxJ 

< - + \-(z i {t) + z i ,{t)) + zi 



< 



n 

(1 + 1)6 



n 

Moreover, this remains true for all t > t[ + i: the inequality is unchanged when agent % is not 
initiating motions, and holds by exactly the same sequence of inequalities when it is. 

Thus after t\ + 2U steps, every agent is always at most e away from its limiting location. 
Since U = 0(n), this proves the theorem, q.e.d. 

Remark: We remark that it is possible to describe a modification of the dynamic control law that 
is robust to unpredictable deletion and addition of agents. We informally sketch this modification 
here; it is based on the observation that the dynamic control law converges to the correct solution 
regardless of initial conditions as long as XliLi-^W + z i'{ty nas the correct value of -F(l). 
Consequently, it suffices to describe a protocol which preserves the sum Y^i=i z i(t) + z i'(t) m 
the event of agent addition and deletion. 

It is easy to deal with the addition of an agent: the new agent simply sets its variables z, h z^ 
to zero, thus preserving the sum. On the other hand, if agent % is deleted we must rely on the 
measurements of neighboring agents to recover zi + z v . However, observe that the rules of our 
control law imply that there is a relation between the sum Zi + and the location of agent 
i. Indeed, agent i's location at the end of update round t is Zi(t) + Zi>(t) to the right of the 



location of agent i — 1. Thus the neighbors of agent i can reconstruct + Zj'(t) after the 
unpredictable deletion of agent i. Once this is completed, the neighboring agents can increase 
their own variables in any way that preserves the sum Y^i=i z i(t) + z i'{t)- 

A similar technique may be used to deal with changes in density; if the agents can detect these 
changes and increase or decrease their own variables Zj, Zy in response, then as long as the number 
of agents and the density eventually stablize, a suitable modification of the dynamic control law 
will converge to the correct answer. An interesting open problem is to prove a dynamic guarantee 
on performance if the number of agents or the density is continually changing. 

V. SIMULATIONS 

We report here on several simulations of our coverage control laws. We are able to observe 
that quite often the performance is considerably better than the theoretical upper bounds derived 
in this paper, and that the dynamic control law of Section IV gives a considerable practical 
speedup over the static control law of Section III. 
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Fig. 3. Left figure displays the progress of both the static (red) and dynamic (blue) coverage control laws in the uniform 
density p(u) — 1. In both cases, the initial condition consists of n = 15 nodes with the same randomly generated positions. 
The iteration number t is represented on the x-axis, and the distribution of agents in [0, 1] on the y-axis. The right figure shows 
the number of agents on the x-axis, and the number of iterations until agents are close to their final values on the y-axis. Initial 
conditions are random as well, and each point on the right graph is an average of 40 runs. 



Figure 3 shows the results from a simulation with random initial conditions. In this case, Xi(0) 
is the i'th largest value of n random variables all uniform on [0, 1]. The density p is uniform on 



[0, 1]. Moreover, we assumed that each agent knows the total number of agents in the system, 
i.e. U = n. The left figure shows some snapshots from the progress of both algorithms, while the 
right figure shows the time until the stopping condition Y^i=i( x i(t)~ ^ m t x i{t)) 2 < 10~ 4 holds for 
the first time. The results show reasonably quick convergence for the random initial conditions. 
We see that for the systems shown in the figure, the static control law has a convergence time 
which grows slower than the quadratic growth proved in Theorem 1, while the dynamic control 
law has convergence time which appears to grow somewhat slower than the linear upper bound 
of Theorem 6. 




Iteration number Number of Agents 



Fig. 4. Left figure displays the progress of both the static (red) and dynamic (blue) coverage control laws in the uniform 
density field p(u) — 1. In both cases, the initial conditions consist of n = 15 nodes all of which start with a;j(0) = 1. The 
iteration number t is represented on the x-axis, and the distribution of agents in [0, 1] on the y-axis. The right figure shows the 
number of agents on the x-axis, and the number of iterations until agents are close to their final values on the y-axis; just as in 
the left figure, every node starts out with Xi(Q) = 1. 

On the other hand, it is not hard to find examples for which the bounds on growth rates from 
Theorems 1 and 6 do occur. In Figure 4, we see the performance when every agent starts with 
Xj(0) = 1; every other aspect is the same as the simulation in Figure 3. We see that in this case 
the convergence times do seem to grow quadratically in n for the static control law, and linearly 
in n for the dynamic control law. 

Intuitively, the issue of convergence rate seems to be related to how fast information propagates 
through the network of nodes. The high weights used in the dynamic control law - specifically, 



the weights 1 — 1/U, which are quite close to 1 - allow for fast diffusion of information if an 
agent is far from where it should be. On the other hand, in the static control law information 
diffuses through nearest neighbor interactions, and this process can be an order of magnitude 
slower. 
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Fig. 5. The left figure displays the progress of both the static (red) and dynamic (blue) coverage control laws in the nonuniform 
density field p(x) — x 2 . The initial conditions consist of n = 15 nodes all of which start with Xi(Q) = 0. The iteration number 
t is represented on the x-axis, and the distribution of agents in [0, 1] on the y-axis. The right figure shows the number of agents 
on the x-axis, and the number of iterations until agents are close to their final values on the y-axis; just as in the left figure, 
every node starts out with Xi(0) = 0. 



Finally, we describe an example in which the agents achieve optimal coverage with respect to 
a nonuniform metric. Figure 5 shows a simulation with density p{x) = x 2 , which requires the 
nodes to concentrate themselves closer to the boundary of 1 of the interval [0,1]. The starting 
point was chosen to be Xj(0) = for i = 1, . . . ,n, in part because it is a difficult starting 
point for the coverage algorithms we have presented here: the agents will have to move right 
one-by-one. Convergence times seem to be somewhat larger here than in the uniform density 
case, and the gap between the convergence times of the dynamic and static algorithms seems to 
be more pronounced. The convergence times shown in the graphs are consistent with the bounds 
of Theorems 1 and 6. The dynamic control law exhibits some interesting dynamics, first moving 
many agents too close before pushing them apart to the optimal configuration. It is of interest 
to understand the non-asymptotic dynamics of this control law better. 



VI. CONCLUSIONS 

We have investigated distributed control laws for mobile, autonomous agents to position 
themselves on the line for optimal coverage in a nonuniform field. Our main results are stated 
in Theorems 1 and 6. Theorem 1 gives a quantitative upper bound on the convergence time of a 
relatively simple control law for coverage. Theorem 6 discusses a dynamic control law which, 
while making stronger assumptions on the capabilities of each agent, manages to accomplish 
the coverage task with an order of magntude fewer sensing/communication/movement iterations 
in the worst case. 

Our work suggests a number of open questions. It is of interest to understand whether 
the increased capabilities of the agents in Section 4 are really necessary to achieve better 
performance. In addition, it would be interesting to explore whether the results described here 
extend to two and higher dimensions, and in particular, whether a dynamic control law such as 
the one in Theorem 6 might be useful for speeding up performance in more general settings. 
Generalizing these results to higher dimensions is challenging because the notions of a "leftmost" 
and "rightmost" agent does not make sense in two dimensions; moreover, our algorithm crucially 
relies on the transformation x — > J* p(z) dz which "stretches" the domain to make it uniform; it 
is not immediately clear how to compute the corresponding transformation in two dimensions in 
a distributed way. Finally, an open problem is to replace the assumption that nodes are capable 
of computing integrals of the density p with the weaker assumption that noisy samples of p are 
available to the nodes. 
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